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Abstract 

Background: In the study of biomolecular structures and interactions the polar hydrogen-n bonds (Hp-n) are an 
extensive molecular interaction type. In proteins 11 of 20 natural amino acids and in DNA (or RNA) all four nucleic 
acids are involved in this type interaction. 

Results: The Hp-n in proteins are studied using high level QM method CCSD/6-31 1 +G(d,p) + H-Bq (ghost 
hydrogen basis functions) in vacuum and in solutions (water, acetonitrile, and cyclohexane). Three quantum 
chemical methods (B3LYP, CCSD, and CCSD(T)) and three basis sets (6-31 1 +G(d,p), TZVP, and cc-pVTZ) are 
compared. The Hp-n donors include R 2 NH, RNH 2 , ROH, and C 6 H 5 OH; and the acceptors are aromatic amino acids, 
peptide bond unit, and small conjugate n-groups. The Hp-n interaction energies of four amino acid pairs (Ser-Phe, 
Lys-Phe, His-Phe, and Tyr-Phe) are quantitatively calculated. 

Conclusions: Five conclusion points are abstracted from the calculation results. (1) The common DFT method 
B3LYP fails in describing the Hp-n interactions. On the other hand, CCSD/6-31 1 + G(d,p) plus ghost atom H-Bq can 
yield better results, very close to the state-of-the-art method CCSD(D/cc-pVTZ. (2) The Hp-n interactions are point 
to n-plane interactions, possessing much more interaction conformations and broader energy range than other 
interaction types, such as common hydrogen bond and electrostatic interactions. (3) In proteins the Hp-n 
interaction energies are in the range 10 to 30 kJ/mol, comparable or even larger than common hydrogen bond 
interactions. (4) The bond length of Hp-n interactions are in the region from 2.30 to 3.00 A at the perpendicular 
direction to the n-plane, much longer than the common hydrogen bonds (-1.9 A). (5) Like common hydrogen 
bond interactions, the Hp-n interactions are less affected by solvation effects. 

Keywords: Protein structures, Molecular interactions, Hydrogen-n interactions, Protein backbones, CCSD, Ghost 
atom 



Background 

The structures of proteins and other biological mole- 
cules are determined by the delicate balance between 
several molecular interactions [1-3]. Among them 
hydrogen-n interactions [4-7] are an extensive inter- 
action type in organic and biological molecules, referring 
to the interactions between hydrogen atoms, attaching 
to different atomic groups, and aromatic molecules or 
n-groups. The hydrogen-n interactions can be classified 
into two groups, the nonpolar hydrogen-n interactions 
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(H-n or CH-n) [8-11] and the polar hydrogen-n interac- 
tions (Hp-n) [12-16]. The typical nonpolar hydrogen-n in- 
teractions are the interactions between hydrogen atoms, 
attaching to carbon atoms, and the conjugate n-systems, 
often indicated by the notation CH-n in references [8,11]. 
The interaction strength and the physical nature and 
properties of nonpolar H-n interactions are often a debat- 
able topic [6,10,17]. In the polar hydrogen-n interactions 
the donors are the polar hydrogen atoms, attaching to 
electronegative atoms (R 2 NH, RNH 2 , and ROH), and the 
acceptors are various aromatic molecules and conjugate 
n-groups [18-20]. The interaction energies of polar 
hydrogen-n interactions are much stronger than that of 
nonpolar hydrogen-n interactions (CH-n), comparable or 
even larger than common hydrogen bonds. In this study 
the notation Hp-n is used for the polar hydrogen-n 
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interactions, to make difference from the nonpolar 
hydrogen-n interactions (H-tt or CH-tt), and the common 
hydrogen bond interactions (H-b) [21-25]. 

The Hp-n interactions frequently happen in biological 
macromolecules, such as proteins and DNA (or RNA), 
and play important roles in protein structures and bio- 
logical functions. In proteins the Hp-n interaction do- 
nors are the polar hydrogen atoms in atomic groups 
[12-16] -NH 2 , >NH, -OH, -SH, and C 6 H 5 OH, which 
exist in amino acids Ser, Thr, Tyr, Trp, Cys, His, Asn, 
Gin, Lys, and Arg, while the acceptors are the aromatic 
amino acids, Phe, Tyr, Trp, and His [19,20]. Three amino 
acids play the roles of both donor and acceptor (His, 
Tyr, and Trp). In the 20 natural amino acids 11 of them 
may be involved in the Hp-n interactions. The peptide 
bond units are a quasi n-group, comprising N, C, and O, 
which can play the roles of both Hp-n interaction ac- 
ceptor and donor [6]. In DAN and RNA the four nucleic 
acid components (adenine, guanine, cytosine, and thy- 
mine) possess aromatic rings and polar hydrogen 
groups, which are the Hp-n interaction acceptors and 
donors [23]. 

The Hp-n interactions are a unique interaction type 
that cannot be classified into other molecular interaction 
types, such as common hydrogen bond [21-25], cation-n 
interaction [26-29], electrostatic interaction, and van der 
Waals interaction. The physical nature and the inter- 
action strength of Hp-n interactions are often a debat- 
able topic. In the Hp-n interactions electron dispersion 
energy is one of the main contributions. Different 
quantum chemical methods may give very different 
interaction energies and descriptions for the interaction 
properties of Hp-n interactions. One reason is that some 
quantum chemical methods and basis sets may fail in es- 
timating the dispersion energies accurately [30]. In this 
study several quantum chemical methods and basis sets 
are evaluated and compared in the Hp-n interaction cal- 
culations. The Hp-n interaction contributors, investigated 
in this study, include the interaction donors R 2 NH, RNH 2 , 
ROH, C 6 H 5 OH, and peptide bond unit NMA (w-methyl 
acetamide); and the interaction acceptors include aromatic 
molecular benzene, hetero- aromatic molecules C 5 H 5 N 
and imidazole, peptide bond unit, carbonyl group, and 
other small conjugate n-groups. 

Results 

In this section all calculation results are summarized 
and reported using tables and figures. Brief comparisons 
and illustrations are provided following the results. 

Hp-n interactions of small conjugate n-groups 

In this section the Hp-n interactions between small do- 
nors and acceptors are used to illustrate the Hp-n inter- 
action properties and strength, including three Hp-n 



interaction pairs CH 3 OH-H 2 CO, NMA-C 2 H 4 , and 
CH 3 OH-C 2 H 4 . The Hp-n interaction structures of three 
Hp-n interaction pairs are shown in Figure 1, and the Hp-n 
interaction energies are listed in Table 1. In Figure 1 the 
small blue circle are 'ghost' hydrogen atoms, which improve 
the Hp-n QM calculations effectively. The role of 'ghost' 
atoms will be introduced in detail in the Method section. 
The HOMOs (highest occupied molecular orbitals) of three 
Hp-n interaction pairs are shown in Figure 1, in which the 
polar hydrogen atoms are in close touching with the 




Figure 1 The Hp-n interaction structures and HOMOs of three 

small interaction pairs. (A) Structure of Hp-n interaction pair 

CH 3 OH-H 2 CO. The polar hydrogen atom of CH 3 0H points to the 

carbon atom of H 2 C0 perpendicularly. (B) The HOMO of Hp-n 

interaction pair CH 3 OH-H 2 CO. The polar hydrogen atom is in close 

touching with the n-orbital of H 2 C0. (C) Structure of Hp-n 

interaction pair NMA-C 2 H 4 . The polar hydrogen atom of NMA (n- 

methyl acetamide) points to the center of double bond. (D) The 

HOMO of Hp-n interaction pair NMA-C 2 H 4 . (E) Structure of Hp-n 

interaction pair CH 3 OH-C 2 H 4 . The polar hydrogen atom of CH 3 0H 

points to the center of double bond perpendicularly. (F) The HOMO 

of Hp-n interaction pair CH 3 OH-C 2 H 4 . The ghost hydrogen atom H- 

Bq is attached to the polar hydrogen atoms, and the distance to 

polar hydrogen is 0.90 A. 
k J 
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Table 1 The polar hydrogen-n (Hp-n) interactions between three small molecular interaction pairs (CH 3 OH-H 2 CO, 
NMA-C 2 H 4 , and CH 3 OH-C 2 H 4 ) 





CH 3 0H- 


-H 2 CO 


NMA-C 2 H 4 




CH 3 OH-C 2 H 4 


Vacuum 


C 


0 


C 


Bond 


C Bond 


a Energy 


-3.6370 


-11.0838 


-11.2956 


-12.8848 


-11.3071 -12.8871 


b Bond 


2.6856 


2.3277 


2.6500 


2. 5943 


2.6453 2.5354 


c Solvent 


Acting position 


Water (£ = 78.39) 


Acetonitrile [e = 


35.9) 


Cyclohexane (e = 2.0) 


CH 3 OH-CH 2 0 


Oxygen 


-2.3002 


-2.5012 




-6.7355 


CH 3 OH-C 2 H 4 


Bond center 


-6.8966 


-6.9797 




-8.8056 



3 Interaction energies are in kJ/mol, calculated using CCSD/6-311 +G(d,p) plus ghost atom H-Bq. 
3 Bond lengths are in angstrom (A). 

: Energies in solutions are calculated using CCSD + PCM method. 



n-orbitals of the aromatic molecules. In other words, in 
the Hp-n interactions the polar hydrogen atoms are bur- 
ied in the n-electron density of aromatic molecules, simi- 
lar to the common hydrogen bond interactions. 

Unlike common hydrogen bond interactions that are 
point-to-point interactions, the Hp-n interactions are 
point to n-group interactions, in which the interactions 
could happen at any position of the n-plane. In Table 1 
for each Hp-n interaction pair two or more interaction 
positions are reported. In the NMA-C 2 H 4 interaction 
pair the Hp-n energy at the bond center (-12.885 kj/ 
mol) is higher than that (-11.296 kJ/mol) at the carbon 
atom of C 2 H 4 . The structural conformations of Hp-n in- 
teractions are much more than that of the common 
hydrogen bond, and the Hp-n interaction energies could 
change in a broad range. The Hp-n interaction energies 
of all three small interaction pairs are larger than half of 
H 2 0-H 2 0 hydrogen bond. The NMA (w-methyl aceta- 
mide) [31-33] is used as the model of peptide bond 
units. The Hp-n energy of NMA may represent the 
Hp-n interactions between the protein peptide back- 
bones and aromatic side chains. The lower part of 
Table 1 is the Hp-n interactions in three solutions 
(water, acetonitrile, and cyclohexane) using CCSD and 
PCM method [34-37]. The Hp-n energies in solutions 
are decrease with the increase of the solvent dielectric 
constants 8. However, the Hp-n energies in aqueous 
solution are still large. At this point the Hp-n interac- 
tions are like common hydrogen bond interaction, lees 
affected by solvation effects. 

The Hp-n interaction energies as the functions of 
interaction distances (R) are shown in Figure 2. For 
comparison the curve of NMA-NMA hydrogen bond 
interaction is also shown in Figure 2, which is the most 
frequent hydrogen bond in proteins. The bong lengths 
(-2.5 A) of Hp-n interactions are longer than that of the 
hydrogen bonds (-2.0 A). The force constants (k) of two 
Hp-n interaction pairs CH 3 OH-C 2 H 4 and CH 3 OH- 
C 6 H 6 are 0.0035 and 0.0055 Hartree/Bohr, smaller that 
(0.0071 Hartree/Bohr) of the NMA-NMA hydrogen 
bond interaction. The Hp-n interactions are more soften 



than the common hydrogen bond interactions in the 
minimum and short interaction distances. 

Hp-n interactions in aromatic molecules 

The aromatic molecules are the best H-n interaction ac- 
ceptors. In this section the Hp-n interactions of two aro- 
matic molecules are studied. One is the typical aromatic 
molecule benzene, and the other is a heteroatom- 
aromatic molecule C 5 H 5 N. The interaction structures of 
three interaction pairs (CH 3 OH-C 6 H 6 , CH 3 OH-C 5 H 5 N, 
and NMA-C 6 H 6 ) are shown in Figure 3, and the inter- 
action energies and bond lengths are summarized in 
Table 2. 

The H-n interaction energies of benzene are remarkably 
larger than that of the small Hp-n interaction acceptors 
(C 2 H 2 and H 2 CO) listed in Table 1. Usually the H-n inter- 
action energies increase with the size of aromatic mole- 
cules. The Hp-n interaction energy (-14.853 kJ/mol) of 
heteroatom aromatic molecule (C 5 H 5 N) is smaller than 
the pure aromatic molecule C 6 H 6 (-22.233 kJ/mol). The 



30 
25 
20 
15 
10 



-10 
-15 
-20 
-25 





CH30H-C2H4, Hp-Pi 




■*- CH30H-C6H6, Hp-Pi 
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Figure 2 The Hp-n interaction energies of CH 3 OH-C 2 H 4 (blue 
diamond) and CH 3 OH-C 6 H 6 (green square) pairs as the function 
of interaction distance (R). For comparison the curve of NMA-NMA 
hydrogen bond interaction is also shown (orange triangles), which is 
the most frequent hydrogen bond in proteins. The bong lengths 
(-2.5 A) of Hp-n interactions are longer than that of the hydrogen 
bonds (-2.0 A). The force constants (k) of two Hp-n interaction pairs 
CH3OH-C2H4 and CH 3 OH-C 6 H 6 are 0.0035 and 0.0055 Hartree/Bohr, 
smaller that (0.0071 Hartree/Bohr) of the NMA-NMA hydrogen bond 
interaction. The Hp-n interactions are more soften than the common 
hydrogen bond interactions in the minimum and short 
interaction distances. 
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Figure 3 The Hp-n interaction structures and HOMOs of three 
aromatic interaction pairs. (A) Structure of Hp-n interaction pair 
CH 3 OH-C 6 H 6 . The polar hydrogen atom of CH 3 OH points to the 
center of C 6 H 6 perpendicularly. (B) The HOMO of Hp-n interaction 
pair CH 3 OH-C 6 H 6 . The polar hydrogen atom is in close touching 
with the n-orbital of C 6 H 6 . (C) Structure of Hp-n interaction pair 
CH3OH-C5H5N. The polar hydrogen atom of CH 3 OH points to the N 
of C5H5N. (D) The HOMO of Hp-n interaction pair CH 3 OH-C 5 H 5 N. 
(E) Structure of Hp-n interaction pair NMA-C 6 H 6 . The polar hydrogen 
atom of NMA (n-methyl acetamide) points to the center of C 6 H 5 
perpendicularly. (F) The HOMO of Hp-n interaction pair NMA-C 6 H 6 . 



Table 2 The interaction energies and bond lengths of 
three aromatic Hp-n interaction pairs ((CH 3 ) 2 NH-C 6 H 6 , 
CH3OH-C5H5N, and NMA-C 6 H 6 ) 





(CH 3 ) 2 NH-C 6 H 6 


CH 3 OH-C 5 H 5 N 


NMA-C 6 H 6 


Position 


Center 


N 


Center 


a Energy 


-18.1457 


-14.8525 


-22.2329 


b Bond 


2.4100 


2.4500 


2.5850 



a Interaction energies are in kJ/mol, calculated using CCSD/6-311 +G(d,p) plus 
ghost atom H-Bq. 

b Bond lengths are in angstrom (A). 



Hp-n interaction energy of NMA-C 6 H 6 (-22.233 kJ/mol) 
is comparable to the H 2 0-H 2 0 hydrogen bond energy 
(-17 — 23 kJ/mol) [38]. This value could represent the 
Hp-n interaction energies between peptide bond units 
and the aromatic amino acids in proteins. 

Hp-n interactions in amino acids 

In the 20 natural amino acids 4 of them possess aro- 
matic side chains (Phe, Tyr, Trp, and His), which are the 
possible Hp-n interaction acceptors. On the other hand, 
10 (Ser, Thr, Asn, Gin, Cys, Tyr, Trp, His, Lys, and Arg) 
amino acids possess various Hp-n donors. The atomic 
group -OH is the Hp-n donor in amino acids Ser, Thr, 
and Tyr. The atomic group > NH or -NH 2 is the Hp-n 
donor of amino acids Asn, Gin, Lys, Arg, Trp, and His. 
Amino acid Cys has the Hp-n donor -SH. Three amino 
acids (Tyr, Trp, and His) play the roles of both Hp-n 
donor and acceptor. In proteins total 11 amino acids are 
involved in the Hp-n interactions. The Hp-n interactions 
of amino acids in proteins are an interesting and import- 
ant research topic. The amino acid Hp-n donors and ac- 
ceptors are shown in Figure 4, in which the Hp-n 
donors are indicated using blue cycles, and the Hp-n ac- 
ceptors are indicated using red cycles. 

In this section the Hp-n interactions of 4 amino acid 
pairs (Ser-Phe, Lys-Phe, Tyr-Phe, and His-Phe) are 
studied and reported. The energies and bond lengths of 
the four amino acid Hp-n interaction pairs are listed in 
Table 3, and the interaction geometries are shown in 
Figure 5. In the calculations the aromatic amino acid 
Phe is simplified as benzene C 6 H 6 , which is the Hp-n ac- 
ceptor in the four interaction pairs. The four Hp-n do- 
nors Ser, Lys, Tyr, and His are simplified as CH 3 OH, 
CH3NH2, C 6 H 5 OH, and imidazole, respectively, as 
shown in Figure 5. Except the Lys-Phe pair, the Hp-n 
interaction energies of other three amino acid pairs are 
close or even larger than the H 2 0-H 2 0 hydrogen bond 
energy. In Table 3 the bond length 3.550 A of Lys-Phe is 
from N of Lys to the benzene ring center of Phe, and 
3.093 A is the distance from a polar hydrogen atom of Lys 
to a carbon atom of Phe. The smaller Hp-n interaction en- 
ergy (-8.766 kJ/mol) of CH 3 NH 2 -C 6 H 6 pair may indicate 
that the RNH 2 is a poorer Hp-n interaction donor than 
the R 2 NH (-18.15 kJ/mol of (CH 3 ) 2 NH-C 6 H 6 ). 

Discussion 

In very recently publications [20] the Hp-n interactions 
were experimentally and theoretically studied by Kumar 
and Dasa using resonant two photon ionization (R2PI), 
IR-UV, and UV-UV double resonance spectroscopic 
techniques, and quantum chemical calculations. In their 
experiments N-H...n hydrogen bonds, slanted T-shaped 
structures, were observed in molecular dimer. The 



Du et al. Chemistry Central Journal 2013, 7:92 
http://journal.chemistrycentral.eom/content/7/1/92 



Page 5 of 8 




Tyr(Y) Trp(\V) His(H) Lys(K) Arg(R) 



Hp-7i acceptors 




Tyr(Y) Trp(W) His(H) Phe(F) 

Figure 4 The Hp-n donors and acceptors of natural amino 
acids. In the 20 natural amino acids 4 of them possess aromatic side 
chains (Phe, Tyr, Trp, and His), which are the possible Hp-n 
interaction acceptors. On the other hand, 10 amino acids (Ser, Thr, 
Asn, Gin, Cys, Tyr, Trp, His, Lys, and Arg) possess various Hp-n 
donors. The atomic group -OH is the Hp-n donor in Ser, Thr, and 
Tyr. The atomic group > NH or -NH 2 is the Hp-n donor of Asn, Gin, 
Lys, Arg, Trp, and His. Amino acid Cys has the donor -SH. Three 
amino acids (Tyr, Trp, and His) play the roles of both Hp-n donor 
and acceptor. In proteins total 1 1 amino acids may be involved in 
the Hp-iT interactions. The Hp-n donors are indicated using blue 
cycles, and the Hp-n acceptors are indicated using red cycles. 



Table 3 The interaction energies and bond lengths of 
four amino acid Hp-n interaction pairs (Ser-Phe, Lys-Phe, 
Tyr-Phe, and His-Phe) 





(Ser-Phe) 
CH 3 OH-C 6 H 6 


(Lys-Phe) 
CH 3 NH 2 -C 6 H 6 


(Tyr-Phe) 
C 6 H 5 OH-C 6 H 6 


(His-Phe) 
lmid-C 6 H 6 


Position 


Center 


Center 


Center 


Center 


a Energy 


-19.7648 


-8.7661 


-21.7831 


-25.771 


b Bond 


2.410 


c 3.550, 3.093 


2.425 


2.315 


length 











a Interaction energies are in kJ/mol, calculated using CCSD/6-311 +G(d,p) plus 
ghost atom H-Bq. 

b Bond lengths are in angstrom (A) from polar hydrogen to the center of 
benzene ring. 

c The Bond length 3.550 A is from N of Lys to the benzene ring center of Phe, 
and 3.093 A is the distance from a polar hydrogen atom of Lys to a carbon 
atom of Phe. 




Figure 5 The Hp-n interaction structures and HOMOs of three 
amino acid interaction pairs. (A) Structure of Hp-n interaction pair 
CH 3 NH 2 -C 6 H 6 . The two polar hydrogen atoms of CH 3 NH 2 point to 
the benzene ring perpendicularly. (B) The HOMO of Hp-n interaction 
pair CH 3 NH 2 -C 6 H 6 . The polar hydrogen atoms are in close touching 
with the n-orbital of C 6 H 6 . (C) Structure of Hp-n interaction pair 
C 6 H 5 OH-C 6 H 6 . The polar hydrogen atom of C 6 H 5 OH points to the 
center of C 5 H 6 . (D) The HOMO of Hp-n interaction pair C 6 H 5 OH- 
C 6 H 6 . (E) Structure of Hp-n interaction pair imidazole-C 6 H 6 . The polar 
hydrogen atom of imidazole points to the center of C 6 H 6 
perpendicularly. (F) The HOMO of Hp-n interaction 
pair imidazole-C 6 H 6 . 

^ J 

experimental observations could be the evidence of Hp- 
tt bonds in molecular interactions. 

The polar hydrogen-n (Hp-n) interactions are very dif- 
ferent from the non polar hydrogen-n (H-tt or CH-tt) in- 
teractions in interaction strength, and in physical nature 
and properties. The main physical contributions in Hp-n 
interactions are the electrostatic interactions, MO co- 
ordinating, and electron dispersion interaction. The 
Hp-n interactions are distance and orientation 
dependent, and the best orientation is the perpendicu- 
lar direction from polar hydrogen atom to the n-plane. 
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The interaction energies of polar hydrogen-n interac- 
tions (Hp-n) are much stronger than that of the dis- 
puted non polar hydrogen-n interactions (CH-tt) [8,9]. 

The three dimensional structures of proteins are not 
rigid constructions, but dynamically flexible. The Hp-n 
interactions are point to n-plane interactions, possessing 
much more interaction conformations and broader en- 
ergy range than that of the common hydrogen bond in- 
teractions. The Hp-n interactions may be responsible for 
the flexibility and dynamic activity of proteins. In bio- 
logical molecules the polar hydrogen atoms are the com- 
mon interaction donors of both hydrogen bond 
interactions and Hp-n interactions. These two interaction 
types happen at different directions. The most favorable 
orientation (perpendicular direction) for the Hp-n interac- 
tions is just the most unfavorable direction for common 
hydrogen bond interactions. Consequently we can expect 
that the common hydrogen bond (H-b) interactions and 
the polar hydrogen-n (Hp-n) interactions are the two 
main interaction forces supporting the three dimensional 
structures of proteins, and playing their roles in different 
directions. 

Based on our calculations, the Hp-n energies in solu- 
tions are decrease with the increase of the solvent dielec- 
tric constants 8. In this study the Hp-n interaction 
energies in solutions are calculated using PCM (Polarized 
Continuum Model) method. The PCM is a continuing 
medium model, which cannot give very accurate results. 
The Hp-n energies in aqueous solution are still significant. 
At this point the Hp-n interactions are like common 
hydrogen bond interaction, lees affected by solvation ef- 
fects. The Hp-n interaction acceptors (aromatic groups) 
are often the hydrophilic groups, which are buried in the 
hydrophobic core of protein structures. In the hydropho- 
bic pockets of proteins the solvent dielectric constants are 
small, and the Hp-n interactions may be still working well. 

Conclusion 

The polar hydrogen-n interactions (Hp-n) are a unique 
interaction type different from other main molecular 
interaction types in physical nature and properties. Some 
useful conclusions can be refined as follows. (1) In the 
20 natural amino acids 11 of them are involved in the 
Hp-n interactions, including Ser, Thr, Gin, Asn, Arg, 
Lys, Phe, Tyr, Trp, Cys, and His. In proteins the Hp-n 
interaction donors are the atomic groups -NH 2 , >NH, - 
OH, -SH, and C 6 H 5 OH; while the Hp-n interaction ac- 
ceptors are various aromatic and heteroatom aromatic 
groups in amino acid side chains. (2) The peptide bond 
units in protein backbones are quasi n-bonds, possessing 
both polar hydrogen atoms and conjugate n-groups, 
playing the roles of both Hp-n interaction acceptors and 
donors. (3) The Hp-n interactions are point to n-plane in- 
teractions, having many possible interaction conformations. 



(4) The Hp-n interaction energies between amino acids are 
in the range from -10 to -25 kj/mol, close or even larger 
than the common hydrogen bonds. (5) The bond length of 
Hp-n interactions are in the region from 2.30 to 3.00 A at 
the perpendicular direction to the n-plane, much longer 
than common hydrogen bonds. (6) Like common hydrogen 
bond interactions, the Hp-n interactions are less affected 
by solvation effects. (7) The common DFT method B3LYP 
fails in describing the Hp-n interactions. On the other 
hand, CCSD/6-311 + G(d,p) plus ghost atom H-Bq at bond 
middle can yield better results using less cpu-time, very 
close to the state-of-the-art method CCSD(T)/cc-pVTZ. 

Method and theory 

DFT method B3LYP has been widely used in the studies 
of organic molecules and biological molecules for many 
years because of its higher accuracy and less calculation 
workload. However, in recent decade the common DFT 
methods were found failing in description of molecular 
dispersion interactions, which are a main contribution in 
Hp-n interactions. On the other hand, more advanced 
quantum chemical configuration interaction (CI) [39-41] 
methods are able to evaluate the dispersion energies. 
However, such sophisticated methods are expensive, 
consuming much more CPU times and computer re- 
source than DFT methods do. Particularly the typical 
bond lengths of Hp-n interactions are around 2.5 A, 
much longer than other molecular interaction types, 
such as hydrogen bonds (-2.0 A). In order to calculate 
the long range Hp-n interactions accurately large basis 
sets have to be used. The large basis sets make the CI 
calculations of Hp-n interactions are even more expen- 
sive and CUP-time consuming. Careful selection of ap- 
propriable methods for the Hp-n interaction calculations 
is the first step of the Hp-n study. 

A comprehensive comparison was performed to evalu- 
ate the calculation results of several methods, including 
three methods (B3LYP, CCSD, and CCSD(T)) [42-45] and 
three basis sets (6-311 + G(d,p), TZVP, and cc-pVTZ) 
[46,47]. Small Hp-n interaction donors (CH 3 OH) and two 
acceptors (C 2 H 4 and C 6 H 6 ) are used in the comparison 
calculations. The Hp-n interaction energy of CH 3 OH- 
C 2 H 4 , one of the smallest Hp-n interaction pairs, 
is -11.853 kj/mol, more than half of the H 2 0-H 2 0 hydro- 
gen bond energy -21.258 kj/mol. The Hp-n interaction 
energy of CH 3 OH-C 6 H 6 is -19.765 kj/mol, very close to 
the common hydrogen bonds. In biological molecular in- 
teractions the Hp-n interactions are comparable to the 
hydrogen bond interactions. A remarkable difference be- 
tween Hp-n interactions and H-b interactions is that the 
hydrogen bonds are point to point interactions, and the 
Hp-n bonds are point to n-plane interactions. 

The Hp-n interaction structures and HOMO (highest 
occupied molecular orbital) of the Hp-n interaction pairs 
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are shows in Figure 1. In the interaction pair CH 3 OH- 
C 2 H 4 the polar hydrogen of CH 3 OH points to the 
double bond center of C 2 H 4 perpendicularly, and the 
polar hydrogen atom points to the carbon and the bond 
center of C 2 H 4 , respectively. In the HOMO figures of 
CH 3 OH-C 2 H 4 interaction pair the polar hydrogen atoms 
are in close touching with the n-orbitals of C 2 H 4 . At this 
point Hp-n interactions are similar to the hydrogen 
bond interactions. In the latter the polar hydrogen atoms 
are buried in the electron density of electronegative 
atoms, such as oxygen and nitrogen. 

The more advanced CI method CCSD (T) gives better 
results than that of the CCSD method. However, the 
cpu-time of CCSD (T) is much longer than that of the 
CCSD. Generally speaking the larger basis sets give the 
better results. In the calculations of Hp-n interactions 
polarization functions, diffuse functions, and floating 
functions are necessary. However, large basis set remark- 
ably increases the cpu-time of CCSD (T) calculations. In 
solving this problem a simple method is the use of 
'ghost atoms'. The ghost hydrogen atom H-Bq is an 
empty atom possessing the basis functions of hydrogen, 
but having no nucleus charge and electron [48,49]. In 
the CCSD calculation using 6-311 + G(d,p) basis set plus 
a hydrogen 'ghost atom' H-Bq yields the result 
(-11.715 kj/mol), very close to the result (-11.853 kj/mol) 
of the state-of-the-art method CCSD(T)/cc-pVTZ. How- 
ever, the cpu-time of CCSD calculation reduces to 1/8 of 
CCSD(T) calculation (5.3 hours to 44.6 hours). Another 
advantage of the use of ghost atom is reducing the basis set 
superposition error (BSSE) [48,49]. 

In the Hp-n interaction calculations by using DFT 
method B3LYP only take cpu-time few minutes. How- 
ever, the Hp-n interaction energies are around 20 ~ 30% 
smaller than that of other two CI methods (CCSD and 
CCSD (T)), because the common DFT methods fail in 
evaluating the dispersion energies, which is an important 
contribution in Hp-n interactions. In recent years the 
shortcoming of DFT methods has been improved by the 
better density functionals or using empirical correction 
for dispersion. 

In this study all calculations are performed using 
CCSD method and basis set 6-311 + G(d,p) + H-Bq, in 
which a 'ghost hydrogen atom' (H-Bq) is attached to the 
polar hydrogen atom, and the distance to polar hydrogen 
atom is 0.9 A. Keep in mind, in the calculations for Hp- 
tt interaction energies the same ghost atom is also added 
to the two molecule monomers. The Hp-n interaction 
energies in solutions are calculated using CCSD and 
PCM (Polarized Continuum Model) method [34-37]. All 
calculations are performed at Sugon-5000A computer 
and Tianhe-IA computer in National Supper Computing 
Center in Tianjin (China) using Gaussian 09 and Gauss 
View 5 software packages [50]. 
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